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Summary 

This  study  investigates  the  relative  ability  of  several  forecast 
methods  proposed  in  the  literature  to  predict  quarterly  EPS  in  terms 
of  the  length  of  the  forecast  horizon.   It  finds,  contrary  to  previous 
notions  of  "search  bias,"  that  individually  identified  Box  Jenkins 
models  provide  the  most  accurate  forecasts  in  certain  cases.   In 
particular,  single  models  sometimes  use  incorrect  differencing  or  omit 
a  constant  term  from  the  model.   When  this  occurs,  the  individually 
identified  model  provides  the  most  accurate  forecast. 


The  literature  is  replete  with  studies  investigating  the  predict- 
ability and  time  series  properties  of  quarterly  earnings  per  share  (EPS) . 
This  research  has  many  motivations.  1)  The  Financial  Accounting 
Standards  Board  (1977)  has  made  expected  future  earnings  a  primary 
factor  in  their  objectives  framework.   2)  The  capital  market /information 
content  studies  often  rely  on  earnings  expectation  or  prediction  models, 
and  the  use  of  inappropriate  or  misspecified  models  can  result  in 
erroneous  inferences.   3)  The  Securities  and  Exchange  Commission  has 
been  giving  serious  consideration  to  requiring  listed  firms  to  report 
earnings  forecasts  (1977).  4)  Earnings  forecasts  are  used  in  investment 
decision  making.  Norby  (1973)  found  that  99%  of  responding  financial 
analysts  used  earnings  forecasts  in  their  decision  making  process.  Also 
earnings  forecasts  are  important  to  research  relating  to  cost  of  capital 
and  dividend  policy.   (See  Foster  [1977]  for  references  on  these  issues.) 

Recent  research  has  indicated  that  the  time  series  properties  of 
quarterly  EPS  can  be  usefully  described  by  several  models  including 
those  studied  by  Brown  and  Rozeff  (1979),  Foster  (1977),  and  Griffin 
(1977)  and  Watts  (1975).   In  addition  some  of  these  authors  have  applied 
the  Box-Jenkins  (1970)  modeling  process.   (These  models  will  henceforth 
be  referred  to  as  the  BR,  F,  GW  and  BJ  models  respectively.)   These 
models  have  been  studied  further  by  Lorek  (1979)  and  Collins  and  Hopwood 
(1980)  who  investigated  their  relative  ability  to  predict  annual  EPS 
from  quarterly  EPS.   The  findings  indicate  that  the  BR  and  GW  models 
appear  to  provide  the  most  accurate  forecasts.   The  BJ  forecasts  are 
reasonably  competitive  but  not  more  accurate.   Therefore,  since  the  BJ 
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modeling  process  is  very  time  consuming,  it  has  thus  far  not  proven  its 
worth  in  this  type  of  forecasting. 

Our  motivation  for  such  an  investigation  stems  from  the  well  known 
fact  that  different  time  series  models  have  distinctly  different  fore- 
cast functions.   (For  a  detailed  discussion  see  Box  and  Jenkins  [1970, 
pp.  144-155].)   For  example,  some  models  may  produce  forecast  functions 
which  take  the  form  of  a  dampened  sine  wave  while  others  may  form  an  up- 
ward curving  line  that  eventually  levels  off  and  converges  to  the  mean 
of  the  series. 

A  particularly  important  aspect  of  forecast  functions  is  that  their 

behavior  changes  as  the  forecast  horizon  increases.   For  example,  the 

Model  y  =  <j>y   ..  +  a   (where  if)  is  a  known  parameter  which  must  be  less 
t     t— 1    t 

than  one  in  absolute  value  and  a  is  an  uncorrelated  error  term  for  time 

*        *k 
t)  has  a  forecast  function  yr+.  =  yr4>     where  k  is  the  number  of  periods 

in  the  future  for  the  forecast.  Note  that  the  first  few  forecasts  from 
this  function  form  a  curved  line,  but  at  longer  horizons  this  line  be- 
comes approximately  straight  (i.e.,  equal  to  zero).   This  means  that  the 
first  few  forecasts  will  heavily  depend  on  y  ,  but  as  the  time  horizon 
increases  the  forecasts  will  become  practically  independent  of  y  .   In 
general  for  a  sufficiently  long  horizon  the  forecast  function  will  depend 
on  the  structure  of  the  model,  and  this  shape  will  be  different  for 
different  mocels.   It  is  therefore  reasonable  to  expect  differences  in 
model  performance  as  the  forecast  horizon  increases. 

The  previous  research,  however,  has  not  included  a  systematic  study 
of  the  relative  accuracy  of  these  models  in  terms  the  length  of  the 
forecast  horizon.    The  purpose  of  the  present  study  is  to  make  such 
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an  investigation.   Specifically  we  investigate  the  possibility  that  the 
relative  performance  of  these  models  changes  as  forecasts  are  made 
farther  and  farther  into  the  future. 

The  study  is  presented  in  five  parts.  Parts  one  and  two  deal  with 
the  population  studied  and  the  statistical  modeling.  Parts  three  and 
four  present  the  findings,  and  part  five  gives  a  summary  and  conclusions. 

Population  Studied 

Data  pertaining  to  the  population  of  267  calendar  year  New  York  Stock 

Exchange  (NYSE)  listed  firms  were  obtained  from  the  Compustat  quarterly 

2 
tape.   For  a  firm  to  be  included  in  the  population,  it  was  required  to  have 

no  missing  EPS  data  for  the  64  consecutive  quarters  beginning  with  the 
first  quarter  of  1962.  This  provided  a  sample  period  from  1962  through 
1977.  The  EPS  number  used  was  primary  earnings  per  share  excluding 
extraordinary  items  and  discontinued  operations,  adjusted  for  capital 
changes. 

Note  that,  unlike  previous  research,  all  firms  which  met  the  sur- 
vivorship test  were  retained  for  analysis.  We  define  this  group  to  be 
the  population  of  interest  and  make  no  attempt  to  generalize  to  a  larger 
number  of  years  or  group  of  firms.  To  use  statistical  testing  to  make 
inferences  about  a  larger  group  of  firms  would  be  unwarranted  because 
there  is  no  reason  to  believe  that  firms  which  fail  to  meet  the 
survivorship  test  are  the  same  as  those  that  do.   In  fact,  a_  priori 
reasoning  indicates  that  firms  meeting  the  test  are  very  likely  to  be 
larger  and  older  than  the  average.   Also  attempting  to  generalize 
across  all  years  would  be  unwarranted  because  structural  changes  in  the 
economy  might  produce  a  shift  in  the  relative  performance  of  different 
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ferent  forecast  methods.   Even  if  this  were  not  a  problem,  in  order  to 
generalize  to  all  years,  it  would  be  necessary  to  obtain  a  reasonably 
large  random  sample  of  years.  This  is  not  possible  because  of  limited 
data  availability. 

Since  statistical  testing  is  used  for  making  inferences  about  a 
larger  population  and  under  the  circumstances  we  felt  that  such  infer- 
ences would  be  unwarranted,  no  statistical  tests  are  presented  in  this 
paper.   Instead,  our  goal  is  to  present  results  for  an  entire  population 
which  is  of  interest  in  its  own  right. 

Model  Estimation 

All  of  the  foregoing  models  were  estimated  for  all  of  the  population 

3 
firms.   The  years  of  1974  through  1977  were  used  as  hold-out  periods 

and  were  used  in  studying  forecast  accuracy.  Therefore,  the  267  firms 

were  each  modeled  16  times,  once  for  each  method  using  pre-1974  data 

(48  quarters  in  the  base  period)  and  again  for  each  method  (49  quarters 

in  the  base  period)  using  all  data  prior  to  the  second  quarter  of  1974, 

etc.   (The  BJ  models  were  reidentified  each  quarter.)   The  result  was 

that  each  model  made  predictions  for  16  quarters  into  the  future  (thus 

providing  a  forecast  horizon  from  1  to  16  quarters)  for  each  of  the  16 

base  periods  in  the  hold-out  period.  For  each  period  in  the 

forecast  horizon  and  each  method  the  mean  absolute  percentage  forecast 

4 
error  was  computed. 

Empirical  Findings 

Figure  1  presents  the  forecast  error  profiles  for  the  BR,  F  and  GW 
models  (we  shall  henceforth  refer  to  these  three  models  as  premier  models 
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since  in  each  case  a  single  or  premier  model  is  used  for  all  firms) .   All 
three  profiles  are  relative  to  the  BJ  model  and  in  each  case  the  forecast 
error  is  equal  to  the  error  for  the  given  model  minus  the  BJ  error. 
Therefore  the  area  below  the  dotted  zero  line  indicates  that  the  given 
model  error  is  smaller  than  the  BJ  error.  Note  that  within  the  first 
four  quarters  on  the  horizon  the  BR  forecasts  are  more  accurate  than 
those  of  BJ  for  the  first  two  quarters.   In  the  third  quarter  the  BJ 
model  is  more  accurate  while  in  the  fourth  quarter  the  two  are  about  equal. 
For  the  first  two  quarters  this  superior  performance  is  consistent  with 
the  above  cited  research  which  indicates  that  the  expensive  BJ  modeling 
process  is  not  warranted.  Note,  however,  that  the  third  quarter  is  the 
beginning  of  a  pattern  where  all  three  models  dramatically  deteriorate 
relative  to  the  BJ.  The  upward  sloping  lines  indicate  that  the  longer 
the  forecast  horizon  the  greater  the  deterioration. 

[Figure  1  about  here] 

An  Explanation  of  the  Finding 

In  order  to  explain  the  finding  we  first  consider  some  background 
on  the  BR,  F  and  GW  models  relative  to  the  BJ  type  model.   The  distin- 
guishing factor  of  the  BJ  model  is  that  the  BJ  modeling  process  selects 
a  model  for  each  firm  whereas  the  BR,  F  and  GW  models  are  single  (or 
"premier")  models  used  for  all  firms.   A  decade  or  so  ago,  when  the  BJ 
modeling  process  was  first  being  applied  in  this  area,  it  was  thought 
that  using  a  separate  model  for  each  firm  would  provide  the  most 
accurate  forecasts.   This  is  because  the  BJ  process  explicitly  models 
all  nonrandomness  in  the  data  and  the  premier  models  often  result  in 
highly  correlated  residual  errors.   Later  Foster  (1977)  proposed  that 
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selecting  a  separate  model  for  each  firm  might  result  in  a  problem  he 
termed  "search  bias."  This  means  that  in  searching  for  a  model  for 
each  firm  one  will  often  select  an  inappropriate  model  because  of  random 
variation  in  individual  firm  EPS.  As  mentioned  above,  subsequent  research 
(Collins  and  Hopwood  [1980]  and  Lorek  [1979])  supported  this  hypothesis, 
and  BJ  model  forecasts  were  not  found  to  be  more  accurate  than  those 
of  the  single  models. 

This  raises  the  question:  why  then  should  the  BJ  models  perform 
better  as  the  forecast  horizon  increases?  Our  hypothesis  is  that 
while  the  BJ  process  is  subject  to  search  bias  it  does  a  better  job 
of  determining  the  need  for  differencing  and  having  a  constant  term 
in  the  model.  We  consider  these  factors  important  because  they  largely 
determine  the  shape  of  the  forecast  function.  Under  this  hypothesis 
we  would  expect  the  premier  models  to  be  at  their  worst  (relative  to 
the  BJ)  when  they  disagree  with  the  BJ  with  respect  to  the  type  of  dif- 
ferencing and  presence  of  a  constant  term  in  the  model.   Henceforth  we 
shall  say  that  the  premier  model  has  the  same  profile  as  the  BJ  when 
it  has  the  sane  type  of  differencing  and  agrees  with  the  BJ  with  respect 
to  the  need  for  a  constant  in  the  model. 

Figures  2,  3  and  4  present  evidence  on  this  hypothesis  for  the  BR, 
F  and  GW  models.   For  each  figure  the  firms  are  partitioned  into  3  groups: 
1)  all  firms.   This  is  the  same  as  Figure  1.   2)  Those  firms  where  the 
given  premier  model  has  the  same  profile  as  the  BJ  model.   In  these  cases 
our  hypothesis  predicts  that  the  premier  model  should  be  at  its  best  rela- 
tive to  the  BJ.   3)  Those  firms  where  the  given  premier  model  has  a  profile 
which  differs  with  that  of  the  BJ.   In  these  cases  we  would  expect  the 
single  model  to  be  at  its  worst. 
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[ Figures  2,  3  and  4  about  here] 
Again  these  figures  show  the  mean  absolute  percentage  error  of  the 
premier  model  minus  the  same  error  for  the  BJ.  This  means  that  the  lower 
the  line  the  more  accurate  the  premier  model  forecasts  are  (relative  to 
BJ),  and  a  line  below  the  zero  line  (dotted  line)  indicates  that  the 
premier  model  outperforms  the  BJ.  Note  that  for  all  three  premier  models 
the  hypothesis  is  confirmed.   In  fact  the  BR  and  GW  models  actually  out- 
perform the  BJ  models  which  have  the  same  profile.  This  means  that  if 
the  BJ  model  has  an  identified  profile  equal  to  that  of  the  premier  model, 
then  the  BJ  model  does  not  produce  more  accurate  forecasts  for  longer 
time  horizons.  Looked  at  from  a  different  viewpoint,  Figures  2  and  4 
indicate  that  the  Figure  1  advantage  of  the  BJ  only  holds  when  the  iden- 
tified model  has  a  profile  different  than  that  of  the  premier  model.  The 
conclusion  is  that  premier  models  are  of  less  general  usefulness  than 
indicated  in  previous  research.  That  is,  they  should  only  be  used  when 
the  data  indicate  a  profile  consistent  with  the  given  premier  model. 

Summary  and  Conclusions 

This  study  investigated  the  relative  ability  of  several  forecast 
methods  proposed  in  the  literature  to  predict  quarterly  EPS  in  terms 
of  the  length  of  the  forecast  horizon.   It  found  something  not  indicated 
in  previous  literature,  namely  that  the  prediction  performance  of  premier 
models  drastically  deteriorates  as  the  forecast  horizon  is  increased. 
Detailed  analysis  of  the  data  indicated  that  phenomenon  is  empirically 
consistant  with  the  hypothesis  that  the  premier  models  sometimes  use 
incorrect  differencing  or  omit  a  constant  term  from  the  model.   The  re- 
sults further  indicated  that  when  the  data  show  that  the  premier  model 
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differs  from  the  BJ  model  in  terms  of  differencing  and/or  a  constant 
then  the  BJ  model  should  be  used.   This  finding  is  significant  since 
previous  research  (Brown  and  Rozeff  [1979],  Collins  and  Hopwood  [1980] 
and  Lorek  [1979])  has  indicated  that  there  is  nothing  to  be  gained  by 
doing  individual  model  identification. 
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Notes 

Brown  and  Rozeff  [1979]  looked  at  forecasts  at  one,  five,  and  nine 
quarters  into  the  future.  However,  they  did  not  explore  the  pattern  of 
forecast  errors  as  the  forecast  horizon  increases.  In  addition  they 
studied  a  very  small  sample  of  twenty  three  firms. 

vfe  required  the  firms  to  be  listed  during  the  entire  sample 
period.   The  Center  for  Security  Price  Research  (CRSP)  monthly  tape 
was  used  to  select  NYSE  listed  firms.   A  firm  was  considered  listed 
if  It  had  monthly  stock  returns  available  for  the  period  studied. 

3 
For  the  all  models  the  parameters  were  reestimated  for  each 

quarter  in  the  holdout  period.  The  BJ  models  were  also  reidentified 

each  quarter. 

4 
The  absolute  percentage  error  is  computed  as  the  average  of 


i.   Since  this  error  metric  can  be  explosive 


Actual  EPS  -  Predicted  EPS 
Actual  EPS 

when  the  denominator  approaches  zero  we  truncated  errors  in  excess  of 
ten  to  a  value  of  ten.  Truncatin  was  necessary  for  only  a  very  small 
percentage  of  the  cases. 
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